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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/599, 087 



DATE: 
TIME: 



07/05/2000 
18 : 53 : 55 



Input Set : 
Output Set: 



A:\00450seq.txt 
N:\CRF3\07052000\I599087 .raw 



<110> APPLICANT: Polverino, Anthony J. 
Luethy, Roland 

<120> TITLE OF INVENTION: Secreted Epithelial Colon Stromal-1 Molecules and Uses 
Thereof 

<130> FILE REFERENCE: 00-450 

<140> CURRENT APPLICATION NUMBER r US/09/5 99,0 87 
<141> CURRENT FILING DATE: 2000-06-21 

NUMBER OF SEQ ID NOS : 21 
SOFTWARE: PatentlnVer. 2.0 
SEQ ID NO: 1 
LENGTH: 744 
TYPE: DNA 

Mus musculus 




<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<400> 



ORGANISM 
FEATURE : 
NAME/KEY 
LOCATION 
SEQUENCE 



CDS 
(38). . 
1 



(274) 



aga 
Arg 



gcttcctccc taggcgtgag actccggctc cttcact atg 

Met 
1 

ttc tgc 
Phe Cys 



tec ggt 
Ser Gly 



ctg 
Leu 



gaa ggg 
Glu Gly 



cac 
His 

agg 
Arg 
55 
gtg 
Val 



eta 
Leu 
40 
ccc 
Pro 



etc 
Leu 
10 

aga aga 
Arg Arg 

25 
tct 
Ser 



tgc atg ctg 
Cys Met Leu 



etc 
Leu 



cct 
Pro 



cat 
His 

aga 
Arg 



cct gec 
Pro Ala 



tgc 
Cys 



aga etc 
Arg Leu 



tec 
Ser 

tgc 
Cys 
60 
cca 



aag 
Lys 
45 
aga 
Arg 



aag 
Lys 
30 
ctg 
Leu 

aac 
Asn 



etc 
Leu 
15 
tec 
Ser 



tgt 
Cys 



ttg 
Leu 



aaa 
Lys 



etc 
Leu 



aca acc tgg aaa 
Thr Thr Trp Lys 
50 

aag eta cca gtc 
Lys Leu Pro Val 
65 



ctt 
Leu 

att 
He 

agg 
Arg 
35 

gga 

Gly 

aag 
Lys 



eta 
Leu 

ttc 
Phe 
20 
cgc 
Arg 

aac 
Asn 

tea 
Ser 



gec ctt 
Ala Leu 
5 

tec tea 
Ser Ser 



tgc 
Cys 

cac 
His 



tgt 
Cys 

aca 
Thr 



tgg gtg 
Trp Val 
70 



cct ggg get etc 
Pro Gly Ala Leu Pro Gin He 
75 

agcgttgatg cccagatgtg 
gecaatgage tgtgacactg 



cag ata tag ggcctcctcc geccagatga 



gagacaccag 
gaatgettea 
aggtgtcacc 
cctccagagt 
ctgactagag 
gaaccctcag 
taacttccta 



attccaagtc ctcattccaa 
gcccacaagt caccatggct 
cagatccaga gactttcaag 
ccgagactac agcatcttgg 
ggtctccaaa ccaacacctg 
taaaccataa atcagcatac acactaaaaa 
<210> SEQ ID NO: 2 
<211> LENGTH: 78 
<212> TYPE : PRT 
<213> ORGANISM: Mus musculus 



aagcatacac actatgttgc cttgcccctt 
cttcagacat cagggeggat ggattgeaga 
aaccttcaga gtcactaagg tccaggctca 
aaaagtccaa gattccacct gtgggagcta 
tgcagagaag caagacctca gtgtgatcag 
tcagccccaa acccctaaca cttaaccact 
atgaaatcat caggaggata ccaaaagaaa 



55 



103 



151 



199 



247 



294 



354 
414 
474 
534 
594 
654 
714 
744 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/599,087 



DATE: 
TIME: 



07/05/2000 
18:53:55 



Input Set : 
Output Set : 



A:\00450seq.txt 
N:\CRF3\07052000\I599087.raw 



74 <400> SEQUENCE: 2 



75 


Met 


Arg 


Leu 


Leu 


Ala 


Leu 


Ser Gly 


Leu 


Leu 


Cys 


Met 


Leu 


Leu 


Leu 


Cys 


76 


1 








5 








10 










15 




78 


Phe 


Cys 


lie 


Phe 


Ser 


Ser 


Glu Gly 


Arg 


Arg 


His 


Pro 


Ala 


Lys 


Ser 


Leu 


79 








20 








25 










30 






81 


Lys 


Leu 


Arg 


Arg 


Cys 


Cys 


His Leu 


Ser 


Pro 


Arg 


Ser 


Lys 


Leu 


Thr 


Thr 


82 






35 








40 










45 








84 


Trp 


Lys 


Gly 


Asn 


His 


Thr 


Arg Pro 


Cys 


Arg 


Leu 


Cys 


Arg 


Asn 


Lys 


Leu 


85 




50 










55 








60 










87 


Pro 


Val 


Lys 


Ser 


Trp 


Val 


Val Pro 


Gly 


Ala 


Leu 


Pro 


Gin 


He 






88 


65 










70 








75 













91 <210> SEQ ID NO: 3 

92 <211> LENGTH: 54 

93 <212> TYPE: PRT 

94 <213> ORGANISM: Mus musculus 

96 <4 00> SEQUENCE: 3 

97 Arg Arg His Pro Ala Lys Ser Leu Lys Leu 



10 



Arg Arg Cys Cys His Leu 
15 

Ser Pro Arg Ser Lys Leu Thr Thr Trp Lys Gly Asn His Thr Arg Pro 
20 25 30 

103 Cys Arg Leu Cys Arg Asn Lys Leu Pro Val Lys Ser Trp Val Val Pro 

45 



98 

100 

101 



104 



35 



40 



106 Gly Ala Leu Pro Gin He 



107 



50 



110 <210> SEQ ID NO: 4 

111 <211> LENGTH: 806 



112 <212> TYPE: DNA 

113 <213> ORGANISM: 
115 <2 20> FEATURE: 



Homo sapiens 



116 
117 



<221> 
<222> 



NAME/KEY 
LOCATION 



119 <400> SEQUENCE 



CDS 

(29). 

4 



- (274) 



120 ggaacgaggg aaaatctgcc ttctcacc atg agg 



121 
122 



Met Arg 
1 



124 ctg etc tgt ate ctg ctt etc tgc ttc tec 

125 Leu Leu Cys He Leu Leu Leu Cys Phe Ser 

126 10 15 

128 aag agg cgt cct gec aag gec tgg tea ggc 

129 Lys Arg Arg Pro Ala Lys Ala Trp Ser Gly 

130 25 30 

132 tgc cac cga gtc cct age ccc aac tea aca 
Pro Ser Pro Asn Ser Thr 

45 50 

136 gtg agg etc tgt aaa cca tgc aag ctt gag 

137 Val Arg Leu Cys Lys Pro Cys Lys Leu Glu 

138 60 65 

140 gtg gtg cct ggg gca etc cca cag gtg tag 

141 Val Val Pro Gly Ala Leu Pro Gin Val 



133 Cys His Arg Val 
134 



ctt eta gtc ctt tec age 
Leu Leu Val Leu Ser Ser 
5 

ate ttc tec aca gaa ggg 
He Phe Ser Thr Glu Gly 
20 

agg aga acc agg etc tgc 
Arg Arg Thr Arg Leu Cys 
35 40 
aac ctg aaa gga cat cat 
Asn Leu Lys Gly His His 
55 

cca gag ccc cgc ctt tgg 
Pro Glu Pro Arg Leu Trp 
70 

cactcccaaa gcaagactcc 



52 



100 



148 



196 



244 



294 
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RAW SEQUENCE LISTING DATE : 07/05/2000 

PATENT APPLICATION : US/09/599 , 0 87 TIME : 18:53:55 



Input Set : A:\00450seq.txt 

Output Set: N:\CRF3\07052000\l599087.raw 

142 75 80 

144 agacagcgga gaacctcatg cctggcacct gaggtaccca gcagcctcct gtctcccctt 354 

146 tcagccttca cagcagtgag ctgcaatgtt ggagggcttc atctcgggct gcaaggaccc 414 

148 tgggaaagtt ccagaactcc acgtccttgt ctcaattgtg ccatcaactt tcagagctat 474 

150 catgagccaa cctcacccca cagggcctca gtcgccacca tgtgggcctc tccagtgcaa 534 

152 accaccgagc attccaccat gaccggtcac agctacaaat ccagagacca tcaatcctgc 594 

154 tagagtgcag ggtggcaagc acccaagggt ggctgaccaa gactgcagag tctcctccat 654 

156 cttcaggtcc attcagcctc ctggcattta actaccagca tccagtggtc cccaaggaat 714 

158 cccttcctag cctcctgaca tgagtctgct ggaaagagca tccaaacaaa caagtaataa 774 

160 ataaataaat aaactcaatg cagacacaaa aa 806 

163 <210> SEQ ID NO: 5 

164 <211> LENGTH: 81 

165 <212> TYPE: PRT 

166 <213> ORGANISM: Homo sapiens 
168 <400> SEQUENCE: 5 



169 


Met 


Arg 


Leu 


Leu 


Val 


Leu 


Ser 


Ser 


Leu 


Leu 


Cys He 


Leu 


Leu 


Leu 


Cys 


170 


1 








5 










10 








15 




172 


Phe 


Ser 


He 


Phe 


Ser 


Thr 


Glu 


Gly 


Lys 


Arg 


Arg Pro 


Ala 


Lys 


Ala 


Trp 


173 








20 










25 








30 






175 


Ser 


Gly 


Arg 


Arg 


Thr 


Arg 


Leu 


Cys 


Cys 


His 


Arg Val 


Pro 


Ser 


Pro 


Asn 


176 






35 










40 








45 








178 


Ser 


Thr 


Asn 


Leu 


Lys 


Gly 


His 


His 


Val 


Arg 


Leu Cys 


Lys 


Pro 


Cys 


Lys 


179 




50 










55 








60 










181 


Leu 


Glu 


Pro 


Glu 


Pro 


Arg 


Leu 


Trp 


Val 


Val 


Pro Gly Ala 


Leu 


Pro 


Gin 


182 


65 










70 










75 








80 



184 Val 

187 <210> SEQ ID NO: 6 

188 <211> LENGTH: 57 

189 <212> TYPE: PRT 

190 <213> ORGANISM: Homo sapiens 

192 <400> SEQUENCE: 6 

193 Lys Arg Arg Pro Ala Lys Ala Trp Ser Gly Arg Arg Thr Arg Leu Cys 

194 "l 5 10 15 

196 Cys His Arg Val Pro Ser Pro Asn Ser Thr Asn Leu Lys Gly His His 

197 20 25 30 

199 Val Arg Leu Cys Lys Pro Cys Lys Leu Glu Pro Glu Pro Arg Leu Trp 

200 35 40 45 

202 Val Val Pro Gly Ala Leu Pro Gin Val 

203 50 55 

206 <210> SEQ ID NO: 7 

207 <211> LENGTH : 77 

208 <212> TYPE: PRT 

209 <213> ORGANISM: Rattus norvegicus 

211 <400> SEQUENCE: 7 

212 Met Arg Leu Leu Thr Leu Ser Gly Leu Phe Phe Met Leu Phe Leu Cys 

213 15 10 15 

215 Leu Cys Val Leu Ser Ser Glu Gly Arg Lys Arg Pro Ala Lys Phe Pro 

216 20 25 30 

218 Lys Leu Arg Pro Arg Cys His Leu Ser Pro Arg Ser Lys Pro He Thr 
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w--> 



219 
221 
222 
224 
225 
228 
229 
230 
231 
233 
234 
235 
237 
238 
239 
241 
242 
243 
245 
246 
247 
249 
250 
251 
253 
254 
256 
258 
260 
262 
264 
266 
268 
270 
272 
274 
276 
278 
280 
282 
284 
286 
288 
290 
292 
294 
296 
298 
300 



Homo sapiens 



RAW SEQUENCE LISTING DATE: 07/05/2 000 

PATENT APPLICATION: US/09/599,087 TIME: 18:53:55 

Input Set : A:\00450seq.txt 

Output Set: N:\CRF3\07052000\I599087.raw 

35 40 45 

Trp Lys Gly Asn His Thr Arg Pro Cys Arg Pro Cys Arg Lys Leu Glu 

50 55 60 

Ser Asn Ser Trp Val Val Pro Gly Ala Leu Pro Gin lie 

65 70 75 

<210> SEQ ID NO: 8 
<211> LENGTH: 4159 
<212> TYPE: DNA 
<213> ORGANISM: 
<220> FEATURE: 
<221> NAME/KEY: 
<222> LOCATION: 
<220> FEATURE: 
<221> NAME/KEY: 
<222> LOCATION: 
<220> FEATURE: 
<221> NAME/KEY: 
<222> LOCATION: 
<220> FEATURE: 
<221> NAME/KEY: 
<222> LOCATION: 
<220> FEATURE: 
<221> NAME/KEY: 
<222> LOCATION: 
<400> SEQUENCE: 8 

atg agg ctt eta gtc ctt tec age ctg etc tgt ate ctg ctt etc tgc 
ttc tec ate ttc tec aca gaa ggtagggcag cccccagggt gcagatccct 
gagcaggatt tcagcatctg ggaagactct gatcaggatt tgttggaggg caggecttgg 



unsure 
(160). . 

unsure 
(3884) . 



(169) 



. (3893) 



exon 
(1). . 



(69) 



exon 
(2627) . 

exon 
(4079) . 



. (2725) 



. (4159) 



nnnnnnnnnn, cgcgcgtact tccagccccg tggtgaagac gaaagagggc tctttctcct 

gaacctatag gtttggggct caggactgcc tgcaggtggc ttgggggttc cattcacagc 
ccctgcaccc ccaaatacat acccagccta agtaaagtgg tgtgttcgcc atgeaaacac 
acatacaacc tctcagctag attactgtgc ttaagtccta cctatctaga atttctggag 
ccattctctt gtacttgtgt catgcttgga acagagtaaa ttagtgttgg gcaaatgaat 
acattaatta gtagaccatc taagtctgaa catcccaaaa cctcatgccc agaaaatatc 
catgagcagc tgaaatgaag gtgtgtgtgg tagggaggtg gggtatgttt atgcatgttt 
agaaggggac accatctttt tacctctata gatatgaata tttagctctc ttgccctttt 
ttcttttttc tttttttttt tttttttgag atggagtctt gctctgtcac ccaggctgga 
gtgcagtggc gctatctcag ctcactgcaa tctccgcctc ctgggttcaa gcaattctct 
gcctcagcct cccaagtagc tgagattaca ggtgcccacc accaagccca gctaattttt 
gtatttttag tacagacagg tttcaccatc ttggccaggc tggtcttgaa ctcctaacct 
cgtaatcctc ccacctcggc ctcccaaagt gctgggatta caggegtgag ccaccatgcc 
tggctgeett tcttgattca gatagctgag tgtttcaatc catttttctc ttgtctaacc 
ctctagaaac tgectacatt tattttttgt tttagtggtt atggttactc aaacttttgg 
gtggggggag ctggagctat agaaatatat aaagagaaga aaaacactca attccatgat 
tcaagagtag ccatgttcaa cattttgttt atttccttgc atgtagaatt tttaaaaatt 
aattgatgta cctatatgtt caaggttata tcttttttat ttatcactat atatattgtt 
ataatcaccc aaaatgetta tgattgaaga tattctggaa gcatttacaa cccagtgtca 
gcagcagcca tctctgagta gtgggattat aacaagtgtt tgttttacaa agtttctgcg 
atgaaaatgt cccacatata taataaggaa aacagtgatt agaattcctc ataaacacag 



48 
99 
159 
219 
279 
339 
399 
459 
519 
579 
639 
699 
759 
819 
879 
939 
999 
1059 
1119 
1179 
1239 
1299 
1359 
1419 



I 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least one n or Xaa. 
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RAW SEQUENCE LISTING DATE : 07/05/2000 

PATENT APPLICATION: US/09/599,087 TIME: 18:53:55 

Input Set : A:\00450seq.txt 

Output Set: N:\CRF3\07052000\I599087.raw 

302 cccgtgacat gcaatttatc agacctctat ttttggacat gttggaggtt gccagtgata 1479 
304 ccctagtgac aattaaatga ggatagatac cttcccccat aaagtttcct atccatttag 1539 
306 gactatctgt agcaaactct tgaagtagca ttaatcaact aatattttca ggtataactt 1599 
308 gctacaagtg aacgtactat gatgaattta catgcttaga catttagata gttcacaatt 1659 
310 gtgtgctttt ccttttttga agcaagatct tgctctcttg cccaggtcgg agtgcagtgg 1719 
312 catgaccacg gctcagtgca ggcttgactt ccagggctca agcaatactc gcacctcagg 1779 
314 ttttccagta gctgggaaaa caggtgcgca ccacaatgcc ctgctaattt ttaaaatttt 1839 
316 ttgcagagac gaggtctctc taagttgccc aggctggtct tgaacttctg gactcaagcc 1899 
318 atcctcccac cttggcctcc cagagtgcta ggatcacagg catgagccac cacacctggc 1959 
320 ctactttgca cattttaatt atgtggtaaa aggtatatat gtacataaag tatgtccttt 2019 
322 attcaggctt tttttctttt tttctttttt ttattttttt gagacgaagt ttttgctctt 2079 
324 gttgtccagg ctggagtgta atggcatgct cttggctcac cacaacctcc gcctcccggg 2139 
326 ttcaagtgat tctcctgcct caacctcctg agtagctggg attacaggca tgcaccaaca 2199 
328 tgccaggctg attttgtatt tttagtagag atggggtttc tccatgttgg tcaggctggt 2259 
330 ctcgaacact cgacctcaag tgatccgccc acctcagcct cccaaagagc taggattaca 2319 
332 ggcatgagcc accacaccca gctcagggct tattttctta ggctagattg ccaaggggag 2379 
334 aattattatg tcaaagaaac tacttattgg acaggaatct gaaaaggatg tgttttgggg 2439 
336 ccatgtgtct cccaacattg ttatttctga aaagtaaatc acaacaaggc ccactctttc 2499 
338 cctaggacct ctcgtagcct ggctcatcct gagtttctct ggataaatat tcctgagccc 2559 
340 tgtgccttgg aaggggaagc tcactcacag acaagcccac taaagacagt ctctcttcct 2619 
342 ttgtgtc cac cct cag gga aga ggc gtc ctg cca agg cct ggt cag gca 2668 
344 gga gaa cca ggc tct get gec acc gag tec eta gee cca act caa caa 2716 
346 acc tga aag gtaagtaccc ccacctcgtc cagactgtgg ggcagaagtt 2765 
348 ctacagtggc catgggacca gccacacaca ctgatcagcc cccacccatg gctggcatca 2825 
350 ggctctggct gggaggacat ctttgttttg ttgattaatt tgttgactcc cccccaaaag 2885 
3 52 tcaacaaatt aatcatttta aactgaatac attctgecat ggaaaaaaag caggatgeaa 2945 
3 54 ttagcagatg ttgtgtggaa acacacttac tttaggtgga aggtgtctga gcaggtgaca 3005 
356 tttatgagac ctggctcatt tatgagecag gagectgget gaggcctgtg gaggtggggc 3065 
358 atgcaggcag aggaggcagc aagggtgaag ggcaagagtg gggtatggaa gacagatggt 3125 
360 agcagggctt gagaggtact cccagaagct aaggaccaaa gctgcctgtg aaccctgtgg 3185 
362 acctggggca cagatcagca tgcaggtcac cagcagggga gtgggcctga gggtccagag 3245 
364 agecataget tggcaggaga taaggcagee ecagagatge cagcaggcag catccaggct 330 5 
366 gcatgaccag aacgaggccc agaagagcaa ggctgccctc tccctgaggc ctggggacac 336 5 
368 tgggaggect gtggcggaca ggcccaagct caggagggct gcgggcaccc agttccctgc 3425 
370 acaggggctg caggcccaga gcagatattc actggagttg cccagcccag gtggaagggt 3485 
372 caggctgetg gagcttgggt agggcaggca gatccccaag gggagactgt ggaccctgag 354 5 
374 tcagacagcc tgacaccaac ctggggctcc tgectgaact ctgcagcccc agtgcccact 3605 
376 ctcaagaggc tgaggaggtc ccggccccac ttgctcctct gcggccatgg cccatggggt 3665 
378 ccatgaccag cgccggagcc tccatgcctt tcccagctac caaggggatg ctcagctgtg 3725 
380 atgeaggaga gggatagagg gaggaagcaa gacagcatga ctccagccgc agaccttctc 3785 
382 ccggagatgc tgacagccct ttcttccaaa ctggcatcac acccagccgg ccaggataaa 384 5 
W--> 3 84 aataaccagc tcgtcttcac caegggctga aggatccenn nnnnnnnnca cgaaaagccc 3 905 
386 cttctgggcc tccagggaaa agcataagat ctaattcttg ctttgaaatt tttttttaaa 3965 
388 tgtgtttgaa aatgeaaett aattgtgttt tcctctctct ccccacaacc tggctctgac 402 5 
390 ctcgccatct tcctgtcctt gtccctcttg tctactcatt gctcctccca gga cat 4081 
392 cat gtg agg etc tgt aaa cca tgc aag ctt gag cca gag ccc cgc ctt 4129 
394 tgg gtg gtg cct ggg gca etc cca cag gtg 4159 

397 <210> SEQ ID NO: 9 

398 <211> LENGTH: 23 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/599 , 087 



DATE: 07/05/2000 
TIME: 18:53:56 



Input Set : A:\00450seq.txt 

Output Set: N:\CRF3\07052000\I599087, raw 



L:ll M:270 C: Current Application Number differs, Replaced Application Number 
L:12 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:260 M:258 W: Mandatory Feature missing, <223> not found for SEQ ID# : 8 
L:260 M:340 W: (46) "n n or "Xaa" used: Feature required, for SEQ ID# : 8 
L:384 M:258 W: Mandatory Feature missing, <223> not found for SEQ ID# : 8 
M:340 Repeated in SeqNo=8 
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